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ABSTRACT 

Friedreich's ataxia (FRDA) is caused by biallelic ex- 
pansion of GAA repeats leading to the transcription- 
al silencing of the frataxin (FXN) gene. The exact 
molecular mechanism of inhibition of FXN expres- 
sion is unclear. Herein, we analyze the effects of 
hyperexpanded GAA repeats on transcription status 
and chromatin modifications proximal and distal to 
the GAA repeats. Using chromatin immunopre- 
cipitation and quantitative PCR we detected signifi- 
cant changes in the chromatin landscape in FRDA 
cells relative to control cells downstream of the pro- 
moter, especially in the vicinity of the GAA tract. In 
this region, hyperexpanded GAAs induced a particu- 
lar constellation of histone modifications typically 
associated with heterochromatin-like structures. 
Similar epigenetic changes were observed in GFP 
reporter construct containing 560 GAA repeats. 
Furthermore, we observed similar levels of FXN 
pre-mRNA at a region upstream of hyperexpanded 
GAA repeats in FRDA and control cells, indicating 
similar efficiency of transcription initiation. We also 
demonstrated that histone modifications associated 
with hyperexpanded GAA repeats are independent 
of initiation and progression of transcription. Our 
data provide strong evidence that FXN deficiency in 
FRDA patients results from a block of transition 
from initiation to a productive elongation of FXN 
transcription due to heterochromatin-like struc- 
tures formed in the proximity of the hyperexpanded 
GAAs. 



INTRODUCTION 

Friedreich's ataxia (FRDA) is an inherited degenerative 
disease that is characterized by progressive ataxia, inclu- 
ding uncoordinated gait and limb movements, weakened 
muscle strength, and diminished senses of position and 
vibration. FRDA is caused by an insufficient level of 
Frataxin (FXN) (1,2). FXN is an evolutionarily conserved 
mitochondrial protein that is involved in iron homeostasis 
in cells (3). Reduced levels of the FXN gene expression in 
FRDA patients are caused by a hyperexpanded tract of 
repeated GAA triplets in intron 1 of the FXN gene (4,5). 
In FRDA patients, the GAA tract frequently consists 
of > 1000 triplets, whereas unaffected individuals have 66 
or fewer repeats at the FXN gene (4). Pathological expan- 
sion of the GAA repeats is associated with localized chro- 
matin changes and transcriptional silencing at the gene; 
however, the underlying molecular mechanisms of 
hyperexpanded GAA-induced transcriptional defects are 
not yet clear. 

The hyperexpanded GAA repeats at the FXN gene have 
been reported to adopt a heterochromatin-like structure 
that is characterized by high levels of di- and tri- 
methylated lysine 9 of histone H3 (H3K9me2/3) and un- 
deracetylated H3 and H4 (6-9). Inhibitors of histone 
deacetylases increase levels of FXN expression in FRDA 
primary lymphocytes and in a murine model (8,10). 
Additionally, altering histone modifications, especially levels 
of acetylation, can partially reactivate expression of the FXN 
gene. Thus, the results of these studies suggest that changes 
in chromatin structure upstream of the hyperexpanded 
GAA repeats induce FXN silencing. However, it is not 
clear whether the heterochromatin-like structure induced 
by the hyperexpanded tract of GAA repeats impacts 
initiation and/or elongation of FXN transcription. 
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Some studies indicate that the heterochromatin-like con- 
formation induced by the hyperexpanded GAA repeats 
extend to the promoter region and affect initiation of 
FXN transcription (11,12). Repressive marks such as 
H3K27me3 and H3K9me3, as well as heterochromatin 
protein (HP1) are enriched at the transcription start site 
(TSS) of the FXN gene in FRDA fibroblast lines, which 
results in the failure of FXN transcription initiation. These 
heterochromatin marks may also affect expression of anti- 
sense transcripts at the region upstream of the FXN TSS, 
thereby interfering with initiation of FXN sense transcripts 
in FRDA patients (11). Other studies suggest that FXN 
deficiency results not only from defective initiation, but 
also transcript elongation (12). Levels of both H3K4me3 
at the TSS (represents active transcription initiation) of 
the FXN gene and H3K36me3 (an indicator of transcrip- 
tion elongation) are decreased at the FXN gene in FRDA 
cell lines. 

Epigenetic changes induced by the hyperexpanded 
GAA repeats are one of the primary therapeutic targets 
in FRDA. A number of studies have demonstrated that 
specific histone deacetylase inhibitors (HDACi) are 
capable of enhancing histone acetylation and FXN expres- 
sion in FRDA cells (8-10). On the contrary, a repressive 
mark, H3K9me3, observed in the proximity of the long 
GAA repeats is sustained during HDACi treatment (8,10). 
Additionally, inhibition of H3K9 methylation with BIX- 
01294 has no effect on FXN expression in FRDA cells 
(17). These results suggest that simultaneous targeting of 
two or more epigenetic silencing pathways may be required 
to restore full activity of the FXN gene. Therefore, a 
detailed definition of the landscape of histone modifica- 
tions associated with hyperexpanded GAA repeats is ne- 
cessary in order to further understand the molecular 
mechanisms underlying chromatin changes in FRDA 
cells and their relationship to FXN deficiency. 

While previous studies have examined selected histone 
modifications at the FXN gene, each study was limited in 
either the scope of the modifications examined or the 
number of cell lines analyzed. Here we report a compre- 
hensive analysis of histone modification patterns in mul- 
tiple lymphoid cell lines derived from FRDA patients and 
unaffected controls. We found that the hyperexpanded 
GAA repeats affect chromatin structure in the proximity 
of the GAA repeats, but do not extend to the promoter. In 
contrast, the distribution of RNA pol II and histone 
methylation marks associated with transcription elong- 
ation were underrepresented in the presence of the 
hyperexpanded GAA repeats at the FXN gene in FRDA 
cell lines. These results strongly support the hypothesis 
that the transcription machinery correctly initiates synthe- 
sis of the FXN mRNA. Interestingly, RNA pol II is 
enriched and appears to be paused at the FXN 
promoter-proximal region in unaffected cell lines, but 
not FRDA cells. We also found that altered enrichment 
of H3K4me3 and H3K79me2 at the FXN gene reflects 
defective transcription at a post-initiation step. Taken 
together, our study provides strong evidence that FXN 
deficiency in FRDA patients results from a block in the 
transition between initiation and elongation of FXN tran- 
scription. This deficiency is possibly attributed to 



heterochromatin-like structures formed in the proximity 
of the hyperexpanded GAAs that act as an obstacle to 
the elongation of FXN pre-mRNA. 

MATERIALS AND METHODS 

Cell lines 

Lymphoid cell lines were purchased from NIGMS Human 
Genetic Cell Repository at The Coriell Institute for 
Medical Research, Camden, NJ, USA. We selected three 
cell lines derived from FRDA patients: GM 15850 harbor- 
ing two expanded alleles of approximately 650 and 1030 
GAA repeats; GM 16798 harboring two expanded alleles 
of approximately 750 and 1000 GAA repeats and 
GM 16209 harboring approximately 800 GAA repeats on 
both alleles of the FXN gene. As controls we also used 
three lymphoid cell lines (GM 15851, GM03928 and 
GM05152) derived from healthy, unaffected individuals 
containing short GAA repeat tracts within a normal 
range. Lymphoid cell lines were grown in RPMI 1640 
medium supplemented with 15% FBS, lOOU/ml penicillin 
and lOOug/ml streptomycin at 37°C in 5% C0 2 . The 
GFP GAA systems [GFP_(GAA-TTC) 0 and GFP_ 
(GAA •TTQseo] were generated by integration of a tract 
of GAA repeats derived from the GM16210 affected cells 
as described previously (9). A DNA fragment harboring 
approximately 560 GAA repeats was PCR amplified from 
genomic DNA isolated from the GM16210 cell line 
(NIGMS Human Genetic Cell Repository at The Coriell 
Institute for Medical Research, Camden, NJ, USA) as 
described in (9). PCR product was cleaved by Bsu36I 
and BssHII endonucleases and the GAA repeat- 
containing fragment was cloned into intron 1 (1.2kb 
from the exonl /intron 1 junction) of the GFP gene of the 
pGFP Int plasmid (13,14). Plasmids containing 0 and 560 
GAA repeats were integrated by site-specific recombin- 
ation into the genome of the HEK293Flp-InT-Rex cell 
line (Invitrogen). The use of identical sites of integration 
for both GFP_(GAA-TTC) 0 and GFP_(GAA-TTC) 560 
constructs allows direct comparison between cell lines 
and eliminates any potential bias resulting from random 
integration events in different chromosomal contexts. 
Integrants were selected using hygromycin (200 ug/ml); in- 
dividual hygromycin-resistant colonies were isolated and 
analyzed for repeat size and GFP expression level. Correct 
splicing of the GFP mRNA was determined as described 
earlier (4). All constructs were sequenced prior to as well 
as after establishing the stable cell lines. The GFP_GAA 
lines were selectively maintained in Dulbecco's Modified 
Eagle's Medium with 10% FBS, lOOU/ml penicillin and 
lOOug/ml streptomycin at 37°C in 5% C0 2 supplemented 
with hygromycin (200 ng/ml) and blasticidin (5 ug/ml). To 
induce transcription of the GFP minigene in the 
GFP GAA system, cells were treated with 0.1 (ig/ml tetra- 
cycline for 24 h. 

Polymerase chain reaction 

The size of the hyperexpanded tract of GAA repeats in 
lymphoid cell lines were determined by conventional PCR 
using primers 2500F and 629 under previously described 
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conditions (4). The insertion of the GAA repeats at an 
intron of the GFP gene in the GFP_GAA system was 
detected by the conventional PCR using the primers 
listed in Table 1. The PCR products were analyzed on 
0.8% agarose gels. 

To perform quantitative reverse transcriptase-PCR 
(qRT-PCR), total RNAs from lymphoid cell lines and 
the GFP_GAA system were isolated using the RNeasy 
Mini Kit (Qiagen). DNase I (TURBO DNA-free; 
Ambion) was added to remove genomic DNA contamin- 
ation from isolated RNAs. The qRT-PCR was conducted 
using the Power SYBR Green RNA-to-C T 1-Step Kit 
(7500 Fast Real Time-PCR System, Applied 
Biosystems). As a control, reactions were also performed 
without reverse transcriptase to confirm a removal of 
genomic DNA. To calculate levels of FXN expression, 
the delta C t value was generated by subtraction of the 
C t value of GAPDH from the C t value of either FXN 
mRNA or four different region transcripts of each cell 
line. Levels of FXN mRNA from each cell line were 
normalized to the delta C t of GM15851. For levels of 
FXN pre-mRNA expression, each delta C t value of four 
different regions at the FXN gene was normalized to the 
delta C t value of the region upstream of the GAA repeat 
from GM 15851. All primers used in this study are listed in 
Tables 1 and 2. 

Chromatin immunoprecipitation 

Chromatin immunoprecipitation (ChIP) was performed 
according to the EZ ChIP instructions (Upstate 
Biotechnology Inc.). Briefly, proteins and DNA we re 



cross-linked with 1% formaldehyde for lOmin (15min 
for RNA pol II ChIP) at room temperature. The cross- 
linking reaction was quenched with 125mM glycine for 
5min (lOmin in the case of RNA pol II ChIP). Whole 
lysates were prepared using a cell lysis buffer (50 mM 
Tris-HCl at pH8.0, 10 mM EDTA and 1% SDS) supple- 
mented with protease inhibitor (Mini EDTA-free protease 
inhibitor cocktail, Roche) and sonicated to obtain 100 
300 bp DNA fragments using a Bioruptor Sonicator 
(Diagenode). The fragmented chromatin was diluted 10 
times with dilution buffer (16.7 mM Tris-HCl at pH 8.0, 
167mM NaCl, 1.2 mM EDTA, 1.1% Triton X-100 and 
0.01% SDS). The chromatin fragments were pre-cleared 
for 1 h using Protein A agarose (Millipore) pre-blocked 
with BSA and salmon sperm DNA. The equivalent of 
5 x 10 6 (1 x 10 7 cells in the case of RNA pol II ChIP) 
cells was immunoprecipitated overnight with 5 ug of anti- 
bodies recognizing histones and histone modifications 
(15 ug of RNA pol II antibody in the case of RNA pol 
II ChIP). The immunoprecipitates were immobilized on 
the pre-blocked protein A agarose beads for 1 h. 
Subsequently, the beads were washed with buffers con- 
taining low salt, high salt, and LiCl. Chromatin was 
eluted from the beads with elution buffer (lOOmM 
NaHC0 3 and 1% SDS) at room temperature for 15min 
twice and subsequently subjected to the reverse cross- 
linking reaction using 5M NaCl at 65° C for at least 5h. 
The DNAs from chromatin complexes were isolated with 
Tris-EDTA buffer (80 mM Tris-HCl at pH 6.5 and 
20 mM EDTA) supplemented with Proteinase K and 
RNase A at 37°C for 30min followed by 42°C for 1 h. 
DNA fragments were purified using phenol/chloroform 



Table 1. Primers used for analyses in the GFP_GAA system 



Primer (5'-3') 



Forward Reverse 



Repeat CTTCCCTTTACACAACGTTTGGGTT GTACTGTTTGGATTCAGTGAGGGACT 

Exl/Ex2 GCGACGTAAACGGCCACAAGTT ATGCCCTTCAGCTCGATGCGGT 

Exl/Int GACGACGGCAACTACAAGACC CTAGGACAAAGGTGCCTAAGACC 

Up AATAGCCTCCTGACCACAGATCCTT CCATGTGACATCTAGCCCCGCA 

Down CCCACAGGCCTGAAACACT TTCATGCGTGCTAGGGTAAA 

Int/Ex2 CCCTAGCACGCATGAACC ATGCCCTTCAGCTCGATGCGGT" 



"This primer is the same as the GFP Exl/Ex2 reverse primer. 
Table 2. Primers used for analyses in lymphoid cell lines 



Primer (5'-3') 



Forward Reverse 



Ex3/Ex4 
InlEx2 
-242 
Pro (or - 
+48 

Up (or + 
+1394 
Down 
In2Ex3 



133) 



1231) 



CCTTGCAGACAAGCCATACA 

AGCACTCGGTTACAGGCACT 

CGCATTTTATAAACAAGGCACA 

CCCCACATACCCAACTGCTG 

AAGCAGGCTCTCCATTTTTG 

GAAACCCAAAGAATGGCTGTG 

GGTACGCCGCATGTATTAGG 

CTGGAAAAATAGGCAAGTGTGG 

GGTAATCATGTTTTGGGTTTTGTGC 



GGTCCACTGGATGGAGAAGA 

GCCCAAAGTTCCAGATTTCC 

GTATGTGGGGCCAGGAGAC 

GCCCGCCGCTTCTAAAATTC 

CCGCAGGCACTCTTCTGT 

TTCCCTCCTCGTGAAACACC 

GCAACCAATCCCAAAGTTTC 

CAGGGGTGGAAGCCCAATAC 

AGTCCTCAAACGTGTATGGCTTGTC 



Number on the name of primers denotes the first base of forward primer on the target region with respect to the TSS, +1. 
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extraction before quantitative real-time PCR (qPCR). The 
qPCR was conducted using the Power SYBR Green-C T 
Kit (7500 Fast Real Time-PCR System, Applied 
Biosystems). The qPCR was carried out as follows: 
lOmin at 94°C, 50 cycles of 30 s at 94°C followed by 
60 s at 60°C. As changes in nucleosome occupancy occur 
upon gene activation and gene silencing (15), it is note- 
worthy that the relative abundance of histone modifica- 
tions (referred to as the 'ratio') determined by qPCR were 
analyzed by normalizing the quantity of the immunopre- 
cipitated sample to the quantity of total histone H3 (total 
H4 in the case of H4K20me3) after normalization with 
inputs. 

Antibodies 

The antibodies used in this study were: anti-rabbit IgG as 
a negative control (Cell Signaling), anti-total H3 (Cell 
Signaling or Active Motif), anti-total H4 (Abeam), anti- 
H3K9/14ac (Upstate), anti-H3K4me2 (Active Motif), 
anti-H3K4me3 (Abeam or Active Motif), anti-H3K9me3 
(Upstate or Active Motif), anti-H3K27me3 (Abeam), 
anti-H3K36me3 (Upstate), anti-H3K79me2 (Upstate or 
Active Motif), anti-H4K20me3 (Active Motif) and an 
antibody against the large subunit of RNA polymerase 
II (N20; Santa Cruz Biotechnology). 

Statistics 

A two-way analysis of variance followed by the 
Bonferroni post hoc test were performed to determine 
the statistical significance of the results of qRT-PCR 



and ChIP analysis between unaffected and FRDA cell 
lines. 

RESULTS 

Reduced level of the FXN pre-mRNA downstream of the 
hyperexpanded GAA tract in FRDA cell lines 

To confirm FXN insufficiency in FRDA cell lines 
(GM 15850, GM 16798 and GM 16209), FXN mRNA ex- 
pression was examined by qRT-PCR using a primer set 
recognizing exon 3 as a forward primer and exon 4 as a 
reverse primer (Figure IB). The level of the FXN mRNA 
expression varies significantly among FRDA patients. 
Affected individuals can express as little as 5% and as 
much as 35% of FXN relative to a control individual 
(16). In order to establish a common epigenetic profile 
of the FRDA individuals, we selected a representative 
cohort of FRDA cell lines based on the expression of 
the FXN mRNA. We selected cell lines expressing low 
(GM16798) medium (GM15850) and high (GM16209) 
levels of the FXN mRNA corresponding to ~8, 17 and 
29% of the amount of the FXN transcript found in the 
GM 15851 control cell line, respectively (Figure IB). 

In order to further determine the effect of the 
hyperexpanded GAA repeats on the progression of tran- 
scription throughout the FXN gene, we examined levels of 
FXN pre-mRNA upstream (from +1231 to +1344 within 
the intron 1) and in three regions downstream (variable 
positions, depending on repeat tract length) of the GAA 
repeats as schematized in Figure 1A. Importantly, all 
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Figure 1. Decreased level of FXN pre-mRNA downstream of the hyperexpanded GAA repeats. (A) Schematic diagram of the FXN gene. Black lines 
under the schematic diagram indicate regions amplified by designated primers used in this study. (B) FXN mRNA was analyzed in three control and 
three FRDA cell lines by qRT-PCR with primers complementary to the exons 3 and 4. Blue-series bars designate control cell lines, while Red-series 
bars designate FRDA cell lines. FXN expression was normalized to the expression level determined by control GM 15851 cells. The experiment was 
conducted in triplicate. All data are expressed as mean ± SD. (C) Relative level of the FXN pre-mRNA was determined in various regions of the 
transcript. 'Up' and 'Down' primers amplify regions in the immediate vicinity of the GAA repeats. InlEx2 and In2Ex3 anneal to the junctions 
between the corresponding introns and exons of the FXN pre-mRNA. The FXN pre-mRNA expression was normalized to the level of the fragment 
amplified upstream of the GAA region in the control GM15851 cells. The experiment was performed in triplicate. 
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primer pairs used in this experiment mapped within the 
pre-mRNA to avoid a bias from amplification of mature 
FXN transcript. 

The qRT-PCR analyses revealed similar levels of the 
FXN pre-mRNA at the region upstream of the 
hyperexpanded GAA repeats in all three unaffected 
control cell lines and two of the three FRDA lymphoid 
cell lines (Figure 1C). The GM 16798 FRDA cell line that 
exhibited a lower level of the FXN pre-mRNA upstream 
of the GAA repeats, as compared to the remaining five cell 
lines, also exhibited the lowest level of mature FXN RNA. 
In contrast to the similar levels of FXN pre-mRNA 
upstream of the GAA triplet repeats, levels of FXN 
pre-mRNA were reduced ~4-fold at regions downstream 
of the repeats in all three FRDA lymphoid cell lines 
relative to unaffected cell lines. 

We also conducted analyses of antisense transcription in 
the FXN locus using strand-specific RT-PCR to evaluate 
the relative contribution of sense and antisense transcripts 
from the FXN locus. In agreement with previous data (1 1), 
we were unable to detect antisense transcripts at the 
regions specified in Figure 1A, either upstream or down- 
stream of the GAA repeats (data not shown). Thus, all 
FXN transcripts measured in our analysis reflect sense 
RNA transcribed in the vicinity of the GAA tract. These 
results strongly suggest that the GAA repeats affect later 
steps of the transcription process that are downstream of 
initiation. 

RNA pol II distribution is influenced in the region 
upstream of the GAA repeats but not at the promoter in 
FRDA cell lines 

In an effort to further understand how the hyperexpanded 
tract of GAA repeats affects progression of FXN tran- 
scription, we analyzed the distribution of total RNA pol 
II at the FXN gene by ChIP assay using an antibody that 
recognizes both phosphorylated and unphosphorylated 



forms of the large subunit of RNA pol II (Figure 2). 
We used seven primer sets to assure the fine mapping 
of the distribution of RNA Pol II in the FXN gene. 
Based on prior studies demonstrating transcriptional 
pausing at the long GAA tracts in vitro, as well as in 
cell-culture systems (17,18), we expected that RNA pol 
II progression would be impeded in the region upstream 
of the hyperexpanded GAA repeats in FRDA cell 
lines, which would result in the accumulation of transcrip- 
tional machinery at the region upstream of the GAA 
repeats. However, we observed decreased levels of RNA 
pol II across the entire coding region of the FXN gene in 
FRDA cell lines compared to unaffected cell lines. 
Importantly, levels of RNA pol II were not changed 
in the FXN promoter region (from —242 to —12) in 
FRDA cell lines compared to control cell lines, suggest- 
ing that FXN promoter activity is not affected in FRDA 
cell lines. These results are in agreement with previ- 
ous findings that similar levels of FXN pre-mRNA 
at the region upstream of the GAA repeats are observed 
in both FRDA and unaffected control cell lines 
(Figure 1C). 

Interestingly, RNA pol II accumulated at a 
promoter-proximal region (from +48 to +150) in unaffect- 
ed cell lines. This was an unexpected finding as the length 
of the GAA repeats in these cells are short, ranging from 7 
to 20 repeats. The accumulation of total RNA pol II at the 
FXN promoter-proximal region is similar to what is 
observed in several other genes (19), where RNA pol II 
is known to pause. Pausing of RNA pol II was not 
detected in the FXN promoter-proximal region of the 
FRDA cell lines. 

Collectively, these results suggest that transcriptional 
defects in FRDA cells begin at a post-initiation step of 
FXN transcription, which leads to decreased occupancy 
of the transcriptional machinery at the promoter-proximal 
region, as well as throughout the FXN gene. 
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Figure 2. Distribution of RNA polymerase H is affected in the region upstream of GAA repeats in FRDA cells. Level of total RNA pol II was 
determined by immunoprecipitating the large subunit of RNA pol II in FRDA and control cells. Average occupancy of RNA pol II across the FXN 
gene in all three FRDA and all three control cells is shown using red and blue dots, respectively. The position of the first nucleotide for the forward 
primer relative to TSS is indicated below the X-axis. The specificity of RNA pol II antibody for ChIP was verified by RNA pol II pausing at the 
+55 bp of the GAPDH gene. Error bars represent standard error of the mean. The experiment was conducted in triplicate using three FRDA and 
three control cell lines. The P-value was generated by comparing the average percentage of input of total RNA pol II between unaffected and FRDA 
lymphoid cell lines. 
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Decreased levels of H3K4me2 and H3K4me3 at the 
region immediately upstream of the GAA repeats 
in FRDA cell lines 

We next examined distributions of H3K4me2 and 
H3K4me3 at the FXN gene, as these marks at the 
promoter are reflective of the transcriptional activity of 
a gene. Although FXN mRNA levels are known to vary 
considerably among FRDA individuals and unaffected 
controls (20), the majority of studies to date have used a 
single unaffected individual as a control to compare chro- 
matin modification changes at the FXN gene in FRDA 
patients (10,21). Here, we analyzed cells from three un- 
affected controls and three FRDA patients to delineate 
consistent and significant changes in chromatin structure 
associated with the hyperexpanded GAA repeats in the 
lymphoid cell lines. 

Although variations in H3K4me2 and H3K4me3 levels 
were observed within compared groups of cell lines, levels 
of these histone marks were not different at the promoter 
region (from —133 to —12) of the FXN gene in FRDA 
cell lines and unaffected controls (Figure 3, upper panels). 
These results are consistent with our quantitative pre- 
mRNA analyses showing that the region upstream of 
the GAA repeats is transcribed at the same level in both 
unaffected and FRDA cell lines. Interestingly, in all three 
FRDA cell lines levels of H3K4me2 and H3K4me3 were 
significantly lower at a region immediately upstream of 
the GAA repeats (from +1231 to +1344). In contrast, 



enrichment of H3K4me2 and H3K4me3 levels were the 
highest at this region in the unaffected cells, which 
correlated with the distribution of total RNA pol II at 
the FXN gene (Figure 2). Levels of H3K4me2 have been 
correlated with non-methylated CpG residues more tightly 
than with transcriptional activity (22,23). Indeed, high 
levels of DNA methylation have been reported in the 
region upstream of the GAA repeats in FRDA cell lines 
(24,25). Therefore, the decreased levels of H3K4me2 in 
FRDA cell lines could reflect increased DNA methylation 
specifically at this region. Previous reports have shown 
that levels of H3K4me3 in coding regions are associated 
with the efficiency of the post-initiation processes during 
active transcription (26). The low levels of H3K4me3 in 
FRDA cell imply that the hyperexpanded GAA repeats 
affect the transition between initiation and elongation of 
FXN transcription. Taken together, these results suggest 
that the tract of hyperexpanded GAA repeats do not 
affect chromatin structures at the FXN promoter region, 
but do affect levels of H3K4me2 and H3K4me3 in the 
vicinity of the GAA repeats, which indicates a defect- 
ive transition between initiation and elongation of FXN 
transcription in FRDA. 

Decreased methylation levels of H3K36 and H3K79 in 
FRDA cell lines 

To further understand how the hyperexpanded GAA 
repeats affect the transcription process throughout the 
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Figure 3. Transcription associated histone modifications are reduced in FRDA cell lines compared to unaffected controls. Transcription status at the 
FXN gene was determined by the distribution of histone modifications using the ChIP assay. DNA from chromatin immunoprecipitated using 
antibodies specific for the indicated histone modifications was subjected to qPCR using primers amplifying promoter region, fragments upstream 
(Up) and downstream (Down) of the GAA repeats and junction between intron 2 and exon 3 (In2Ex3). The experiment was conducted in triplicate 
using three FRDA and three control cell lines. Data are expressed as mean ± SEM. P-values were calculated by comparing the averages of the ratios 
between unaffected and FRDA lines. For all histone modification analyses shown in Figures 3-6, ChIP data are presented relative to input DNA and 
normalized to the total H3 (or H4 in the case of H4K20me3) in each region. 
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FXN gene, we measured the levels of H3K36me3 and 
H3K79me2, which are characteristic markers of transcrip- 
tion elongation. In unaffected controls, the level of 
H3K36me3 gradually increased toward the 3'-end of the 
FXN gene, consistent with normal RNA pol II progres- 
sion as shown in Figure 2. However, the level of 
H3K36me3 was reduced across the entire FXN gene in 
FRDA cell lines, indicating defective transcription elong- 
ation in FRDA cell lines. 

Recently, a genome-wide study revealed that full-length 
transcription is often characterized by co-enrichment of 
both H3K4me3 and H3K79me2 at regions downstream 
of the TSS (27). In contrast, enrichment of H3K4me3 
alone is characteristic of genes that undergo transcription 
initiation, but not elongation (28). We found high levels of 
H3K79me2 in regions upstream and downstream of the 
GAA repeats in the unaffected control cells, whereas levels 
of H3K79me2 were significantly decreased throughout the 
entire FXN gene in FRDA cells (Figure 3, bottom panel). 
Because levels of H3K4me3 were unchanged at the FXN 
promoter in FRDA cell lines (Figure 3, upper panel), these 
results support the initial observation (Figure 1) that 
FXN transcription is hampered at an elongation step 
after successful initiation. 

Enriched H3K9 methylation is accompanied by increase 
of H3K27 and H4K20 methylation at the FXN gene 
harboring the hyperexpanded GAA repeats 

Other groups have reported decreased histone acetyl- 
ation and increased methylation of H3K9 in the regions 
flanking the hyperexpanded tract of GAA repeats in 
FRDA cells, patient autopsy tissues and mouse FRDA 



models (7,10,11,21,24). We confirmed that the levels 
of H3K9/14ac were significantly decreased at the FXN 
gene, specifically at the region upstream of the GAA 
repeats, in FRDA cells when compared with unaffected 
cells (Figure 4, upper panel). In contrast to previous find- 
ings, in the promoter region of the FXN gene, the levels 
of H3K9/14ac were similar in both groups. Consistent 
with the results of H3K4me3, RNA Pol II and FXN 
pre-mRNA expression analyses, these data indicate that 
the hyperexpanded GAA repeats do not influence 
promoter activity. 

We also confirmed that H3K9me3 was greatly enriched 
in the region upstream of the GAA repeats in FRDA cell 
lines (Figure 4, upper panels). Although the majority of 
H3K9me3 is associated with repressed and silenced 
regions throughout the genome, this histone modification 
is also found in the coding regions of transcriptionally 
active genes (29). In centromeric heterochromatin, where 
gene expression is repressed, enrichment of H4K20me3 is 
accompanied by enrichment of H3K9me3 (30,31). To as- 
certain whether high levels of H3K9me at the FXN gene 
represent a heterochromatin-like structure adopted by 
regions surrounding the hyperexpanded GAA repeats, 
we determined the status of H4K20me3 at the FXN gene 
in FRDA and control cell lines (Figure 4, bottom panel). 
ChIP experiments showed that enrichment of H3K9me3 
was also associated with significant enrichment of 
H4K20me3 at the region upstream of the hyperexpanded 
GAA repeats in FRDA cells. These data further demon- 
strate that the region immediately upstream of the 
hyperexpanded GAA repeats adopts a heterochromatin- 
like conformation in FRDA cell lines. 
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Figure 4. The region upstream of the GAA repeats in FRDA cells is enriched with histone modifications associated with repressive chromatin 
structure. Chromatin structure at the FXN gene was defined by ChIP using antibodies specific to the histone modifications indicated in each graph. 
The experiment was conducted in triplicate using three FRDA and three control cell lines. Data are expressed as mean ± SEM. 



Nucleic Acids Research, 2011, Vol. 39, No. 19 8373 



Repressive chromatin structures, such as the inactive X 
chromosome and silenced inducible genes, are frequently 
associated with enrichment of H3K27me3 (32,33). 
Mapping of H3K27me3 in the FXN gene revealed an 
over representation in the region upstream of the GAA 
repeats in FRDA cell lines, co-localizing with H3K9me3 
and H4K20me3 enrichments. Taken together, our data 
indicate that the hyperexpanded tract of GAA repeats in 
the FXN gene is associated with a unique constellation of 
repressive histone modifications that lead to the formation 
of a heterochromatin-like conformation. 

Heterochromatin-like structure associated with the 
hyperexpanded GAA repeats is formed in the 
absence of transcription 

To investigate whether the alteration of histone modifica- 
tions at the FXN gene result from inherent characteristics 
of the FXN gene or from the hyperexpanded GAA 
repeats, we employed a GFP_GAA reporter system that 
harbors a single copy of the GFP reporter gene containing 
560 or 0 GAA repeats in the intron as previously described 
(9). The GFP_GAA system was established by inserting a 
hyperexpanded tract of GAA repeats, derived from a 
FRDA cell line, into the intron of a GFP reporter gene. 
GFP transcription, driven by a CMV promoter, is 
controlled by tetracycline repressor/operator. The 
GFP_(GAA-TTC) 560 and GFP_(GAA-TTC) 0 cells 
mimic FRDA and unaffected control cells, respectively. 
Increased length of the GAA tract up to 560 repeats 
leads to 2-fold decrease in the expression of the GFP 
reporter gene (9). To assess changes in chromatin caused 
by the insertion of long GAA repeats in the context of the 
GFP gene, we analyzed the status of histone modifications 
in the vicinity of the GAA repeats in the GFP GAA 
system (Figure 5). We confirmed lower levels of H3K9/ 
14ac and high levels of H3K9me3 in the proximity of the 
GAA repeats in GFP_(GAA • TTC) 56 o cells as previously 
demonstrated (9). We found that both H3K9/14ac and 
H3K9me3 enrichment extended up to the junction 
between exon 1 and the intron as well as the intron and 
exon 2 of the GFP gene in the GFP_(GAA-TTC) 560 cells. 
Next, we examined levels of H3K4me3 and H3K79me2 at 
the GFP gene. No significant changes in the levels of 
H3K4me3 were detected at the junction between exon 1 
and intron as well as in the region upstream of the repeats 
in the GFP_(GAA*TTC)56o cells relative to the construct 
lacking the GAA repeats. Levels of H3K4me3 were 
reduced in the region downstream of the GAA repeats 
in the GFP_(GAA • TTC) 560 cells. This distribution of 
H3K4me3 is similar to that found at the FXN gene in 
the FRDA cell lines. Additionally, we found that 
H3K79me2 was dramatically reduced at the region down- 
stream of the GAA repeats in the GFPJGAA • TTC) 560 
cells when compared to the corresponding region in the 
GFP_(GAA • TTC) 0 cells, indicating an impediment of 
transcription elongation. These results clearly demonstrate 
that the hyperexpanded GAA repeats not only induce epi- 
genetic changes in the surrounding chromatin independent 
of the DNA context, but also are sufficient to inhibit tran- 
scription elongation. Moreover, the landscape of histone 
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Figure 5. Long GAA repeats induce repressive chromatin conform- 
ation in the GFP reporter gene. The levels of histone modifications in 
the GFP_GAA reporter system, schematically shown in the top panel, 
were determined by ChIP assay. The black box represents CMV 
promoter; the white box designates tetracycline operator that allows 
regulating transcription initiation by tetracycline. The black lines cor- 
respond to the location of the PCR products amplified following ChIP. 
White bars represent the GFP_(GAA'TTC) 0 cells. Gray bars represent 
the GFP_(GAA-TTC) 560 cells. Exl/Int, junction between exon 1 and 
intron; Up, region upstream of the GAA repeats; Down, region down- 
stream of the GAA repeats; Intl/Ex2, junction between intron and 
exon2. Data are expressed as mean ± SEM. Asterisk indicates statis- 
tical significance (one: i><0.05, two: P<0.01, three: P< 0.001). The 
abundance of histone modifications is shown relative to input DNA in 
the PCR and normalized to a total H3 for each region. 



modifications in the GFP_(GAA • TTC) 560 cells is similar 
to that observed in FRDA cell lines. These facts strongly 
suggest that elongation, but not initiation, is the major 
step affected by the hyperexpanded GAA repeats in 
FRDA patients. 
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Hyperexpanded GAA repeats are capable of inhibiting 
the progression of RNA polymerases in vitro and in vivo 
(17,18). Moreover, transcription arrest is associated with 
changes in posttranslational histone modifications 
(34-36). To determine if RNA pol II arrest can trig- 
ger the cascade of silencing events, we used the 
GFP_GAA system to precisely control initiation of tran- 
scription using a tetracycline-regulated operator/repressor 
(Figure 6). We used GFPJGAA • TTC) 560 and 
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Figure 6. The tract of GAA repeats is sufficient to form the 
heterochromatin-like structure in the absence of transcription. (A) 
The expression of GFP mRNA was analyzed in the absence/presence 
of tetracycline using qRT-PCR. White bars represent results obtained 
from cell line lacking the GAA repeats in intron of the GFP gene while 
gray bars representing cells harboring intronic 560 GAAs. Data are 
expressed as mean ± SEM. The experiment was conducted in triplicate 
and P-values were calculated from the average of all three determin- 
ations for the GFP_(GAA-TTC) 560 and the GFP_(G AA • TTC) 0 cells. 
(B) Effect of transcription on the formation of the heterochromatin-like 
structure was analyzed using ChIP assay in the GFP_GAA system. 
Enrichment of histone modification at the region upstream (Up) and 
the region downstream (Down) were determined in the absence and 
presence of tetracycline (0.1 ug/ml for 24 h). 



GFP_(GAA - TTC)o cells that were sub-cultured for less 
than 10 passages prior to the experiment and had never 
been maintained in the presence of tetracycline. Prior to 
ChIP analyses the cells were cultured in parallel in the 
presence or absence of the tetracycline. Upon the inhib- 
ition of transcription, the levels of H3K9me3 were 
enriched in the regions flanking the GAA repeats in the 
GFP_(GAA-TTC) 560 cells. After induction of transcrip- 
tion by the tetracycline, levels of H3K9me3 were un- 
changed in the vicinity of the GAA repeats in the 
GFP_(GAA-TTC) 56 o cells, as compared to the non- 
tetracycline treated cells. However, levels of H3K79me2 
were significantly decreased at the region downstream of 
the GAA repeats in the GFPJGAA • TTC) 560 cells upon 
tetracycline treatment when compared to the cells cul- 
tured in the absence of the tetracycline. These results 
strongly suggest that the hyperexpanded GAA repeats 
are the underlying cause for the formation of the 
heterochromatin-like structure. The arrest of RNA pol 
II is unlikely to serve as a signal instigating epigenetic 
changes associated with long GAA repeats. Taken 
together, our results demonstrate that FXN deficiency in 
FRDA patients is a consequence of defective transcription 
elongation through the heterochromatin-like structure 
induced by the hyperexpanded GAA repeats. 



DISCUSSION 

In this study we demonstrated that the heterochromatin- 
like structure induced by the hyperexpanded GAA repeats 
leads to the inhibition of transcription elongation. 
Moreover, this phenomenon is independent of the 
sequence context of the hyperexpanded GAA repeats. 
The shift in histone modification patterns induced by 
GAA repeats is locally restricted to the regions upstream 
of the hyperexpanded GAA repeats at the FXN gene in 
FRDA cells and does not extend to the promoter region. 
Consequently, similar levels of the FXN pre-mRNA were 
detected upstream of the GAA repeats in FRDA and un- 
affected cell lines. On the contrary, a dramatic decrease of 
the FXN pre-mRNA was observed at the region down- 
stream of the hyperexpanded GAA repeats, indicating 
an impediment of FXN transcription elongation in 
FRDA cell lines. These results are corroborated by differ- 
ences in the histone modifications as well as RNA pol II 
occupancy throughout the FXN gene in FRDA cell lines. 
Similar to the mutated FXN gene in FRDA cells, intro- 
duction of the longer GAA repeats into a reporter gene 
results in formation of heterochromatin-like conformation 
and silencing of the transgene. The changes in the chro- 
matin conformation triggered by the hyperexpanded GAA 
repeats are independent of transcription, indicating that 
the hyperexpanded GAA repeat tract is the primary 
inducer of the heterochromatin-like structures that subse- 
quently inhibit the transcription process. 

Comparison of histone modifications and RNA pol II 
distribution at the FXN gene between control and FRDA 
cell lines provides a strong indication that FXN defi- 
ciency in FRDA cell lines originates at a post-initiation 
step of transcription. Several reports using genome-wide 
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approaches have suggested that post-initiation events are 
rate-limiting in the regulation of transcription (19,37). 
This phenomenon was first described for a heat shock — 
inducible gene, Hsp70, in Drosophila (38). High levels of 
RNA pol II were detected at the promoter of the 
uninduced Hsp70 gene. The paused RNA pol II was 
released by heat shock stimuli, which resulted in activation 
of Hsp70 transcription (38). The increased level of RNA 
pol II in the promoter-proximal region correlates with the 
enrichment of active chromatin marks such as H3K9/14ac 
and H3K4me3 at pause sites in the human genome (19). In 
our analyses, pausing of RNA pol II at the promoter- 
proximal region could be readily detected in control cell 
lines, suggesting that this is part of physiological regula- 
tion of the FXN gene expression. Although the occupancy 
of RNA pol II at the promoter region is similar in un- 
affected and FRDA cell lines, pausing of RNA pol II is 
absent at the promoter-proximal region in the FXN gene 
containing the hyperexpanded GAAs. Pausing of RNA 
pol II may facilitate remodeling of the chromatin, as 
shown in our results, to stimulate further transcription 
progression, as well as stabilizing the binding of transcrip- 
tion machinery facilitating subsequent rounds of tran- 
scription initiation. Indeed, new FXN mRNA is slowly 
synthesized in FRDA cell lines compared to unaffected 
cell lines (21). The depleted RNA pol II in the coding 
region of the genes is associated with low levels of 
H3K4me2, H3K4me3, hyper-methylated DNAs at CpG 
residues, and hyper-methylated histone H3 at lysine 9 
(22,23). As expected, we detected low levels of total 
RNA pol II in the region upstream of the GAA repeats 
in FRDA cells. Moreover, the region immediately 
upstream of the GAA repeats exhibits low levels of 
H3K4me3 and H3K4me2 in FRDA cells as compared to 
unaffected cells. H3K4me3 in the coding region is 
associated with the efficiency of the post-initiation step 
in transcription (22,23) and acts as an anchor to recruit 
chromatin-modifying complexes (26,39). Thus, the signifi- 
cant decrease of H3K4me3 at the region upstream of the 
GAA repeats in FRDA cells may impair the recruitment 
of chromatin-modifying complexes that are necessary to 
produce full-length transcripts, which subsequently results 
in the impediment of FXN transcription elongation in 
FRDA patients. 

Regulatory elements located in the first intron effect 
expression of several genes (40,41). Studies using reporter 
constructs have demonstrated that deletions of fragments 
of the FXN intron 1 significantly decrease luciferase 
activity (24). Results of our ChlP experiments show the 
most pronounced difference in the chromatin landscape 
between FRDA and control cell lines exists in the intronic 
region upstream of the hyperexpanded GAA repeats. This 
region is also preferentially hyper-methylated in FRDA 
patients (24) and the levels of DNA methylation correl- 
ate with the number of the GAA repeats (25). One of 
the methylated CpG residues at the region upstream of 
the GAA repeats is embedded in an E-box motif 
(CANNTG) that is recognized by bHLH transcription 
factors. Analyses of global distribution of the c-Myc, the 
bHLH transcription factor, demonstrated that this protein 
interacts with the E-box located in the region upstream of 



the GAA repeats (42). Moreover, the induction of c-Myc 
has been shown to upregulate FXN transcription (43). 
Myc proteins have been demonstrated to stimulate gene 
expression via release of the paused RNA pol II (28) or by 
interaction with histone modifying proteins such as 
histone acetyltransferases (44,45). In contrast to FRDA 
cells, the region upstream of the GAA repeats encompass- 
ing the E-box motif in unaffected cells is enriched in 
H3K4me3, H3K4me2 and H3K79me2 histone modifica- 
tions, potentially facilitating the binding of myc proteins. 
Hypermethylated intronic regions (including the E-box 
CpG) in FRDA patient cells, perhaps affect the inter- 
actions between the E-box sequence and myc proteins 
(46,47). Taken together these data emphasize the import- 
ance of the region upstream of the repeats as the regula- 
tory element of FXN expression. It is likely that long GAA 
repeats induced epigenetic changes in the first intron of the 
FXN gene and interfere with transcriptional enhancers 
leading to the reduced progression of transcriptional ma- 
chinery and consequently to the silencing of FXN expres- 
sion in FRDA. 

Histone methylation status represents both an import- 
ant regulatory element of gene expression and an indicator 
of the transcriptional status of a gene. Enzymes that add 
or remove methyl groups on specific sites of histones 
regulate the balance between methylation and demethyla- 
tion in the cell. The activity of a large group of histone 
demethylases containing the Jumonji C-terminal domain 
is dependent on the presence of non-heme iron (48). 
Several reports demonstrate that low expression of 
FXN, an iron chaperon, results in the accumulation of 
iron in the mitochondria and the depletion of iron in 
other cellular compartments leading to a severe imbalance 
of iron in FRDA cells (2,3). This suggests that the con- 
centration of iron is associated with the activity of an 
iron-dependent demethylase in FRDA patients. In the 
FRDA cells when compared to controls, we detected sig- 
nificant changes in various histone methylation marks 
including H3K27me3 that can potentially be affected by 
iron-dependent histone demethylases. In fact, overexpres- 
sion of iron dependent UTX demethylase is responsible 
for H3K27me3 demethylation and has been demonstrated 
to increase expression of FXN mRNA in KYSE180 
cancer cells (49). Additionally, FXN deficiency has been 
shown to downregulate expression of genes involved in 
DNA packaging and nucleosome assembly (50). 
Although, expansion of the GAA repeats is the primary 
factor driving chromatin changes, it is possible that meta- 
bolic changes resulting from FXN deficiency in FRDA 
cells contribute to the epigenetic landscape at the FXN 
gene. Global analyses of nucleosome position and 
histone modifications in FRDA are necessary to reveal 
the full spectrum of epigenetic consequences attributed 
to reduced FXN levels. 

Dissecting the mechanism of transcriptional silencing 
induced by the hyperexpanded GAA repeats is crucial 
for the rational design of therapeutic strategies aimed to 
alleviate transcriptional blocks imposed by the repeat ex- 
pansions. Currently, the most promising small molecules 
enhancing transcription of the FXN gene are HDAC in- 
hibitors. HDAC inhibitors target the epigenetic 
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consequences of the GAA expansions and alter the chro- 
matin status in the vicinity of the repeat tract without 
affecting the chromatin landscape at the promoter 
region of the FXN gene (10). However, the initial event 
or signal instigating the cascade of epigenetic changes in 
the GAA repeat region remains an open question. Perhaps 
the hyperexpanded GAA sequences or non-canonical 
DNA structures transiently formed by the hyperexpanded 
GAA tracts recruit specific proteins that initiate modifica- 
tions in the chromatin environment surrounding the 
repeats. Additionally, antisense transcription at the FXN 
locus (FAST-1) can be involved in triggering silencing of 
the FXN locus (11). Solving these questions will bring us 
significantly closer to developing an effective treatment for 
FRDA and other diseases associated with repeat expan- 
sion and gene silencing. 
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